Must diphone synthesis be so unnatural?

نویسندگان

  • William J. Barry
  • Claus Nielsen
  • Ove Andersen
چکیده

An English utterance was synthesized in four versions using sets of diphones produced under four different prosodic and contextual conditions. The synthesis used either accented diphones only or appropriately located accented and unaccented diphones, with each of these conditions being repeated using neutral-context and differentiated-context diphones. They were presented to two listener groups, a native English and a non-native group for paired comparison acceptability judgements. The results show a massive preference for the stressand context-differentiated condition. Both stress and context had a significant effect on acceptability judgements, but context-differentiation raised acceptability more strongly than stress-differentiation. Both the native and the main sub-group of non-native listeners judged the stimuli in essentially the same way.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A biphone constrained concatenation method for diphone synthesis

Diphone concatenation [1] has the advantages of simplicity and a relatively small database of speech when compared to other concatenative synthesis methods (e.g., [2]). However, diphone concatenation faces two notable problems. The first is coarticulation which extends beyond the scope of a single diphone and entails some degree of contextual mismatch for virtually any diphone in at least some ...

متن کامل

Segmentation and Labelling of Slovenian Diphone Inventories

Preparation, recording, segmentation and pitch labelling of Slovenian diphone inventories are described. A special user friendly intert'ace package was developed in order to facilitate these operations. As acquisition of a labelled diphone inventory or adaptation of a speech synthesis system to synthesise further voices is manually intensive, an automatic procedure is required. A speech recogni...

متن کامل

A Diphone Sharing Method Towards Scalable Unit-training-based TTS

One of the most popular applications of Text to Speech (TTS) is in embedded devices. The resource limitation of embedded device requires the footprint of TTS system to be very small. Toshiba TTS for embedded device is a unit-training-based system and uses diphone as basic unit. The trained diphone inventory occupies a large part of the footprint. This paper proposes a diphone sharing method to ...

متن کامل

Computational Phonology: Merged, not Mixed

Research into text-to-speech systems has become a rather important topic in the areas of linguistics and phonetics. Particularly for English, several text-to-speech systems have been established (cf. for' example llertz (1982), Klatt (1976)). For Dutch, text-to-speech systems are being developed at the University of Nijmegen (cf. Wester (1984)) and at the Universities of Utrecht and Leyden and ...

متن کامل

Synthesis and Control of Synthesis Using a Generalized Diphone Method

Generalized Diphone Control is a powerful means of building a musical phrase from dictionaries of analysed sound units by building sequences of units and concatenating and articulating them. ~rough a graphical user interface on Macintosh, the Diphone 2.0 software provides analysis, control and synthesis according to various models, such as the Sinusoidal Additive model and the Chant model. A la...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001